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In this paper, we present a new development of inspection games in a mean field setting. In our 
dynamic version of an inspection game, there is one inspector and a large number N interacting 
inspectees with a finite state space. By applying the mean field game methodology, we present 
a solution as an e-equilibrium to this type of inspection games, where e goes to 0 as N tends to 
infinity. In order to facilitate numerical analysis of this new type inspection game, we conduct 
an approximation analysis, that is we approximate the optimal Lipschitz continuous switching 
strategies by smooth switching strategies. We show that any approximating smooth switching 
strategy is also an e-equilibrium solution to the inspection game with a large and finite number N 
of inspectees with e being of order 1/N. 
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1 Introduction 


An inspection game is a non-cooperative game whose players are often called an inspector and an 
inspectee. It models a situation where the inspectee, which may be an individual, an organisation, 
a state or a country, is obliged to follow certain regulations but has an incentive to violate them. 
The inspector tries to minimise the impact of such violations by means of inspections that uncover 
them. 

A simple example of an inspection game can be described by the following 2x2 normal form 
game in Table [U where the inspectee is the row player and the inspector is the column player, 
and the left (resp. right) entry of each cell corresponds to the payoffs for the individual (resp. 
inspector). Typically an inspection game has a mixed equilibrium. 



Inspect 

Not Inspect 

Violate 

-LI 

2,-2 

Comply 

0,-1 

0,0 


Table 1: The simplest two-player inspection game 

Inspection games were introduced by Dresher (1962) and Kuhn(1963), the underlying motivation 
was the cold war between the US and the Soviet Union and the desire to monitor the various arms 
control agreements that were signed by the two superpowers. Analytically, these settings led quite 
naturally to two-person game formulations with various assumptions about the strategy sets that 
were feasible to each of the two parties. 

Inspection games have been investigated quite extensively during the last five decades. They 
have a wide variety of applications to name a few such as arms control by Avenhaus et al. (1996), 
auditing of accounts by Borch (1990), tax inspection by Greenberg (1984) and Aim and McKee 
(2004), environmental protection by Avenhaus (1994), quality control in supply chains by Reyniers 
and Tapiero (1995), Tapiero and Kogan (2007), Hsieh and Liu (2010), stock keeping by Fandel 
and Trockel (2008) and communication infrastructures by Gianini et al. (2013), Chung, Hollinger 
and Isler (2011). The research on inspection games contributes to the construction of an effective 
inspection policy for the inspector when an illegal action is executed strategically. 

In the literature, attentions are mainly on two-person zero-sum games, with some drifts to 
two-person non-zero-sums models. As far as we know, there are very few models with multiple 
inspectees. In the arms control inspection context, Kilgour and Averhaus (1994) considered a model 
with two or more inspectees, where the inspectees are independent and the inspector’s inspection 
is a binary variable, namely inspector decides whether to inspect or not. Later Avenhaus and 
Kilgour (2004) studied another model with two inspectees, where the inspector has a fixed level 
and continuously divisible inspection resources. They answered the question on how the inspector 
distributes efficiently its limited inspection resources over several independent inspectees. 

In a recent work on distributed information systems, Gianini et al. (2013) study a simultaneous 
one-shot inspection game with uncoordinated m inspectors and n non-interacting inspectees. In 
their model, an inspector has limited resources but the probability of detection is not a function of 
the resourcee. They show that due to the lack of coupling among inspectees, adding or removing 
inspectees does not change the best mixed strategy of one inspectee. 

Kolokoltsov, Passi and Yang (2013) develop a dynamic inspection game model with one in¬ 
spector and a large number of interacting inspectees in an evolutionary setting. In that model, 
each inspectee is under evolutionary pressure and periodically updates their strategies after binary 
interactions. Specifically, at the beginning of each period, an exogenous and fixed fraction of the 
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population can update their behaviour upon meeting another randomly chosen individual in the 
population. If two inspectees meet and have the same strategy, then that strategy is retained by 
the updating individual. If however, the two individuals have different strategies, the updating 
individual may revise his behaviour on the basis of the payoffs enjoyed by the two in the previous 
period. Solutions in terms of mixed strategies to this type of games are presented therein. 

In this paper, we present a new development of inspection games in a mean field setting. In 
this new model of inspection games, there is one inspector and a large number of interacting 
inspectees. Different from the settings by Kilgour and Avenhaus (1994), Avenhaus and Kilgour 
(2004) and Gianini et al. (2013) where the multiple inspectees are independent, our new model 
considers interacting inspectees in the sense that one inspectee’s payoff does depend on one another’s 
strategy. Also different from the evolutionary setting by Kolokoltsov, Passi and Yang (2013) where 
the binary interactions are considered and inspectees have no payoff functions, in our new model 
each inspectee aims to maximise her own total expected payoff which depends on the aggregate 
behaviour of the population. We aim to approximate the Nash solution to this new type of dynamic 
inspection games by using the recently developed theory of mean field games. 

The mean held game theory is a new branch of game theory and it has become a powerful 
tool to study complex games with a large number of players. The initial work done by Lasry and 
Lions (2006a, 2006b, 2007) and Huang, Malhame and Caines (2006, 2007) consider continuous¬ 
time continuous-state games. In this paper in the context of inspection games, we study a mean 
held game in a finite state space setting. The finite state space setting is also considered within the 
context of socio-economic sciences by Gomes, Mohr and Souza (2010, 2013), Gomes, Velho, Wolfram 
(2014a, 2014b), wherein the authors study one hyperbolic equation and focus on the analysis of 
shock-formation phenomenon of the system in the setting of two-state mean held games. Though 
those above mentioned papers consider a similar framework as the one in this paper, different 
approaches are applied. In this paper, we apply a modified version of the standard mean held 
games method, namely we study a system of coupled two differential equations with one being 
forward and the other one backward. Our model contributes to the study of mean held games on 
a hnite state space with a major player. 

The paper is organised as follows. Section 2 describes in detail the model of an inspection 
game with one inspector and a large number N inspectees. In Section 3, we set up a mean held 
inspection game with a continuum of inspectees and derive the system of coupled equations (13.121) - 
(|3.13l) . Main results of this paper are shown in Section 4. First, in Theorem 14.11 we show that for 
a short time game, there exists a unique solution to the mean held inspection game, namely, the 
single inspector has a unique best response to the continuum of inspectees and any representative 
inspectee has a unique best response to the inspector and the aggregate behaviour of the continuum; 
whereas, for a long time game, we show the existence of a solution to the mean held inspection 
game. Then in Theorem 14.21 we show that the probability distributions of hnite IV-inspectees on 
their state space converges to the one of a continuum limit as N —> oo. Finally, in Theorem 14.31 
we conclude that an optimal Lipschitz continuous switching strategy, which is derived from the 
mean held inspection game, is an e-equilibrium to an inspection game with a hnite number N of 
inspectees, with e = e(N) —> 0 as N —> oo. In Section 5 we conduct an approximation analysis in 
order to adapt our theoretical results for numerical analysis and to discuss the rate of convergence. 
We approximate optimal Lipschitz continuous switching strategies q* by a sequence of smooth 
switching strategies q* , rj > 0. We show that any approximating smooth switching strategies q* is 
also an e-equilibrium solution to the hnite N inspection game, with e = e(N, rf) —> 0 as N —> oo 
and 77 —> 0. Further, using smooth switching strategies q* as an e-equilibrium solution, we show 
that e = e(N, rf) is of order 1/N. 
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2 An inspection game with N interacting inspectees 


We consider a dynamic inspection game in a continuous time setting with a finite time horizon 
T > 0. In this game, there is one inspector and N (a fixed integer) inspectees (refereed to as 
the population). Roughly speaking, every inspectee chooses her crime levels they would commit to 
maximise her payoff function. The controlled dynamics of crime levels for each inspectee is modelled 
as a controlled continuous-time Markov Chain. The inspector decides the amount of investment 
for inspection so as to maximise her payoff, based on the observation of the crime distribution. 
To study this game, normally one looks for a profile of best responses for all inspectees and the 
inspector. 

Formally, first we discuss the N inspectees. Let h d = {h, ■ ■ ■, l d }, d G N, be the state space of 
any inspectee. States k G L^, i = 1,..., d, are interpreted as crime levels; in other words, U can be 
understood as illegal profits one can gain by committing crimes. 

Denote by E^ the set of probability distributions on L^, i.e. 

d 

= {x = (xi,... , x d ) G [0, l] d : ^2 x i = !} (2.1) 

3 = 1 

and by C([0, T], E^) the set of continuous curves {x(t) G E d,t G [0,T]}, equipped with the norm 

NOIloo = sup \\x(t)\\ ( 2 . 2 ) 

te[o,T] 

where || ■ || denotes the Euclidean norm in RA 

The dynamics of every inspectee is modelled by a continuous-time Markov chain on L^. Every 
inspectee chooses a switching strategy between the crime levels to maximise her own objective 
function. Specifically, the dynamics of the crime levels of the inspectee aG{l,...,IV}is modelled 
by a continuous-time Markov chain M A = {M^ a \t),t G [0,T]} on the state space L^. For a given 
curve {x(t),t G [0, T]} G C([0, T], E^), the stochastic dynamic is specified by the switching 

matrix 



( 

q a {t,h,x(t)) 

\ 


( Qii(t,x(t)) . 


Q a {t,x(t)) = 


q a {t,k,x{t)) 


= 

(*>*(*)) ■ 




q a {t,l d ,x{t)) 

) 


V 9di(*.»(*)) • 

• g*i(t.z(t)) ) 


which is chosen by the ath inspectee. At any time t G [0, T\, iGE d and j A L the entry qfj(t,x) 
is in [0,(5] and presents the infinitesimal transition rate from state Zj to state lj, bounded by a 
constant Q > 0. Moreover for any i, J2j=i Qij(ti x ) = A namely q^{t,x) is chosen in such a way 
that q*(t, x) = - x). 

We are interested in symmetric inspectees, meaning that any inspectees who are at the same 
time at the same crime level will choose the same switching rate. In other words, the choice of a 
switching strategy does not depend on the identity of each inspetee. Thus we can omit the identity 
index and for any ( t,U,x ) G [0, T] x x E^ denote 

q(t,k,x ) := q( a \t,li,x ) and q%j{t,x) = qfj(t,x). 

Next we discuss the dynamics of the population which is denoted by 

A 7V = {(A 1 7V (t),...,X ( f(t)),tG[0,r]}. 
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The dynamics X N is a Markov process and describes the evolution of crime distributions, that is 
for any i = 1,..., d 

Xi ( ‘>- N - 

specifies the proportion of inspectees at the crime level /, at time t £ [0, T}. The superscript N in 
X N is used to distinguish between the dynamics in the finite population of size N and the one in 
the limit (to be introduced in Section 3). The state space of the population is denoted by 


§ 


N 

d 





which is a subset of the closed simplex defined in (12.111 . 

The Markov process X N on the state space is generated by the time-inhomogenous operator 
L? : C(S^) -> C(S^) defined by 


L t f(y ) = ( y ‘ e i) ( nA t ->y) N 


i,j =1 

*7 M 





(2.4) 


where ej, i £ {1,... , d}, denotes the standard basis in R d , namely the zth entry is 1 and all the 
other entries are 0; (y ■ e*) = rii/N for any y = ( m/N ,..., rid/N). 

An intuitive probabilistic interpretation of the stochastic process X N = {(Xf^(i),..., X^ (t)),t £ 
[0,T]} is as follows. At the initial stage of this game t = 0, the N inspectees are distributed arbi¬ 
trarily among the d states {h, ■ ■ • ,ld}- The initial state of the population is described by the vector 
A^(0) = (ni(0)/lV,... ,rid(0)/N). Here rij( 0), j £ {1,... ,d}, specifies the number of inspectees 
at the crime level lj at t = 0. As the dynamic of each inspectee is modelled as a Markov chain 
with a switching matrix in the form of (12.31) . every inspectee has a random waiting time at their 
current crime level before she switches to another crime level. Denote by t\ the shortest waiting 
time among N inspectees. Then at the time ti, it is the first time when an inspectee changes her 
crime level, say from l{ to lj. Consequently, from the initial state A JV (0), the Markov process X N 
obtains the new state: 


x N ( n ) 


I«(0)_A + 2 = 


N N 


( ni(0) nj{ 0) - 1 

V N N 


Tlj ( 0 ) + 1 

N 


nd( 0) \ 

N J 


Then the process X N evolves in the manner as described above from the new state X^(ri). 

It is important to note that the curve {x(t),t £ [0, T]} in (12.31) is the realisation of the 
crime distribution evolution {(A( v (t),... , X^(t)), t £ [0, T]} in the population of N inspectees 
or {(Ai(t),... ,Xd(t)),t £ [0, T]} in the continuum of inspectees (see in the following Eq. (13.1311 ). 
This is the exact place to see how inspectees interact with each other: the state dynamics of ath in- 
sepctee M is influenced by all other inspectees strategies through the crime distribution; in other 
words, any inspectee’s state dynamics is influenced by the aggregated behavior of all inspectees. 
This is what we mean by the mean field interaction setting. 

In this model, we do not require that every inspectee has perfect information on the crime levels 
of all other inspectees, but we assume that everyone has the access to the exact information about 
the aggregate behaviour of the population, namely the crime distribution. 

Next, we introduce the single inspector. The inspector has limited available resources for 
inspection, denoted by F > 0. At any time t £ [0, T], she needs to decide an amount of resource 
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a(t) £ [0, .F] to be invested in inspection in order to maximise her total expected payoff function. 
It is assumed that the inspection resource is uniformly distributed among the population and the 
inspector will charge a fine alj, a > 0, if she uncovers a crime at the level lj £ L^. 

Due to the limited resources, a complete surveillance of all inspectees’ actions is practically not 
possible. Therefore, inspection takes place in form of a randomisation. We introduce a detection 
function, which relates the detection probability to inspection resources and is given by P : [0, F] —> 
[0,1]. It is assumed that the detection function P is nondecreasing and concave in inspection 
resources a , i.e. 

P'(a ) > 0 and P"(a) < 0, for a £ [0, F]. (2-5) 

One way to understand the value P(a) for an a £ [0, F] is that, every inspectee is inspected and 
an illegal behaviour can be uncovered with the probability P(a). Another interpretation of P(a) 
is that, with the inspection resources a invested, a proportion P(a) of the population is inspected 
with perfect inspection, in other words, every inspectee has a probability P{oi) to be inspected and 
an illegal behaviour will be detected with probability 1. 

Now, we are ready to introduce the payoff functions for inspectees and the single inspector. At 

each time s £ [0, T] with a crime distribution X N (s) of the population, if the inspector invests a(s) 

for inspection, the ath inspectee with her crime level (s) faces a probability P(a(s )) with which 

her illegal behaviour will be detected and the inspectee has to pay a fine aM^\s)‘, on the other 

hand, she escapes with a probability 1 — P(a(s)) and gains an illegal profit M^\s). Moreover, the 

inspectees pays a cost for the change of strategies, which is quadratic in the transition rates i.e., 

V \ Q 2 (a) (s, X N (s)). Therefore, at each time s with a crime distribution X N (s) of the 

hr M N ( s > M^'(s)lj 

population, the ath inspectee with a crime level M(s) has a running payoff 

(1-P(a(s)))M^ } (s) - P(a(s))aM^ } (s) - ^ q^^^s, X N (s)) 


and a terminal payoff X n (T)), where Jy : Lj x -> R. Therefore, for a given 

X N £ C([0, T], Tid), the ath inspectee aims to mamixise her payoff function 


jW(t,U, qi -,X N ) = E lt 


(1 - P{a(s)))M^\s) - P(a(s))aM^ (s) 


E 




q M^\s) l , xN (*))] ds + J t( M n C T),X n (T )) 


( 2 . 6 ) 


over switching strategies g, = {(qn(t),... ,qid(t)),t £ [0, T]} with qij{t) £ [0, Q\ for any t £ [0,T] 
and j = 1,... , d, Y7j=\ Qij(t) = and Qu(t) is such that q tl (t) = - 

In the meanwhile, the inspector can observe the crime distribution X N (s ) of the population. 
At any time s £ [0, T], the inspector pays an amount of investment resources for inspection a(s) 
and gets a payoff from any individual inspectee at the crime level If. 

(s) := V N (P(a(s))ak - (1 - P(a(s)))k ), (2.7) 

where rj N > 0 is given and prescribes the weight that the inspector assigns to any single inspectee in 
the finite N population. The inspector aims to maximise her expected payoff at each time instance 
s £ [0, T], that is she wants to maximise 

E ^-a(«) + XJ NXfis)*?^ (2.8) 
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over a(s) £ [0, F], Plug (12.71) into (12.81) and define L := Nr] N . Then the inspector aims to maximise 
her payoff function 


C/ JV (a(s),X iV ( S ))=E 



d 

(s) + L'£x»(s)(p(a(s))al i 

i —1 



(2.9) 


over her inspection investment a(s). 

By differentiating the function Un with respect to the first variable and together with Condition 
(ESI), the function Un in Eq. (12.91) has a unique maximiser a* N : E^ —> [0,F]: 


a* N (s ) = a* N (X N (s)) : = arg max Un((x, X n (s)) 

ae[0,F] 


= mm 


{F 


,/\-i 


L 


(l + <t)E [zUliXfis) 


( 2 . 10 ) 


where (P 7 )” 1 denotes the inverse function of P'. 

Note that the index N is used in the notations of objects in the setting of finite N inspectees 
to distinguish them from their counterparties in the mean field inspection game, see Section 3. 

Remark 2.1. The parameter i] N is small, compared to the large number N of inspectees. The 
parameter L = Np N in ra can be interpreted as the approximate total fine from the whole 
population of the inspectees. In Section 3, we will consider a limiting model by sending the number 
of inspectees to infinity, i.e. N ^ oo. In the limiting model, the value of the parameter L is kept 
the same as in the finite population problem and rj N —» 0 as N —> oo. In other words, in the 
inspector's viewpoint, when the number of inspectees becomes very large, her total expected payoff 
is always bounded, and any individual inspectee’s contribution in E2D to the inspector’s payoff 
becomes negligible. 


We are interested in finding Nash equilibria of this type of inspection games. This means to 
find the best-response investment strategy a* N (t) at any t £ [0,T] and a family ... , 

where Q*( a \a £ {1,... ,1V}, denotes the best switching strategy of the ath inspectee as the best 
response. When N is very large, the complexity of this problem gets immense. 

In this paper, we will apply the mean field games methodology to solve this type of games and 
provide an e-equiliblium. Roughly speaking, we will take the number N of inspectees to infinity 
and set up the (limiting) mean field model with a continuum of inspectees. We call this game with 
a single inspector and a continuum of inspectees a mean field inspection game. Then we prove that 
any solution to the mean field inspection game presents an e-equilibrium to the original model with 
N inspectees. 


3 The mean field inspection game 


In this section, we study a mean field inspection game with a continuum of inspectees and one 
inspector. First, let’s discuss the dynamics of the continuum limit as N —> oo. The state space of 
the continuum is naturally specified by in (12.11) . Observe that, for / £ C ll (E^), 


lim N 

TV—>• oo 
y^rx 


Ay - | + f) - Ay) 


df , \ df ( , 

x) - 


dx 


dxi 
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so that the limiting generator A t : C 1 (Srf) — > C(Srf) of L J in (12.41) as N —> oo is of the form 


A t f(x) 


Jim L?f(y) 

Af->oo 


!>«<*'*> “ ^ (l) ) 

*74? 

d qj, 

E ( x iqij(t, X ) - s)) 7^-(z). 

J=1 J 


(3.1) 


The limiting operator ^4^ in (13.11) generates a controlled crime distribution evolution of the contin¬ 
uum of inspectees on the state space £<$> which is denoted by X = {X(t) = (Xi(t),... ,Xd(t)) : 
t € [0,T]}, where Xi(t) describes the fraction of inspectees at the crime level Zj. Then the limiting 
distribution evolution X is governed by the kinetic equation 


i = i,... ,d. (3.2) 

3 = 1 

It is worth noting that in contrast to X N generated by L ^ in (12.41) . the limiting evolution X is 
deterministic, since it is modelled as the solution to the ordinary differential equation (|3.2D . 

Now in the mean field inspection game, at any time t £ [0,T] with a crime distribution X(t) = 
(Xi(t ),..., Xd(t)), the inspector aims to maximise her payoff U(a(t),X(t )) over her inspection 
investment a(t), where 


d 

U : [0, F] x Srf —» R, U(a, x) := — a + L((l + cr)P(a) — l) Uxi. (3-3) 

i —1 

By the condition (12.51) . for any X(t) £ the function U in (13.31) has a unique maximiser a*[t) 
given by: 


a*{t) = a*(X(t)) 


arg max U(a,X(t)) 
a£[0,F] 


min < {P 


j \-1 


i(i + ^)E-=i^(t), 


(3.4) 


where ( P ') _1 denotes the inverse function of P'. 

Let {M(t),t € [0, T]} denote the controlled state dynamics of a representative inspectee of the 
continuum of inspectees. Starting at any time t £ [0, T] and a state U £ L^, a representative 
inspectee of the continuum aims to maximise the payoff 




J (l-P(a*(X(s)))M(s) - P(a*(X(s)))aM(s) 

- E ^M{s)l^ X ^))]ds + J T{ M iT),X{T)) 

h^M(s) 


(3.5) 








where the function a* is defined in (13.41) and the terminal cost function Jt : IL^ x —> R. Denote 
the running cost function in the payoff (|3.5I) by h : x x R c/ —> R 


h(k,x,qi) = (1 - P(a*(x)))k - P{a*{x))ak - ^ qf 7 - (3.6) 

i¥=i 

= k- li(l + a)P(a*(x)) qfj 

where qi = (qn, ■ ■ ■ ,qid)- The value function for a representative inspectee V : [0, T] x x 
C([0, T], Erf) —> R is defined as 


V(t,U-,X)= supEt,,. / h(M(s),X(s),q(s,M(s),X(s)))ds + J t (M(T), X(T)) (3.7) 


9f(v.O •R 

over measurable functions q : [ t,T] x Lj x Sj -) R rf . For any t £ [0, T] and X £ C([0, T], E^), 
denote the norm of a value function V on by 


\\V(t,-,X)\\ := sup \V(t,li-X)\. 
h 6L d 


(3.8) 


By the dynamic programming principle, for any given distribution evolution X £ C([0, T], E^), 
the value function V in (13.71) satisfies Hamilton-Jacobi-Bellman (HJB) equation 

^(t,U) + H(U,V(t,-),X(t))= 0 (3.9) 

with a terminal function V(T, •) = Jt{-,X(T)), where the function H : x R d xE,j->R 


H(li,<j),x ) 


sup k - k(l + a)P{a*{x)) - ^2 Qij + 22 ^ ~ <&)9y 
9ieRd j^i j& 


(3.10) 


with the function a* defined in (13.41) . where cj) = (</>i,..., 4>d) and qi = {q t \ ,..., q t d) £ R d . The first 
order condition shows that the unique maximiser in (13.101) is 


q*(t,k,x-(j)) = (?*i(i,x; <£),..., q* d (i, a;; 0)) 


(3.11) 


where for j / i 


{ 0 if 4>j — 4>i < 0 

\{4>3 ~ <t>i) if 0 < fa - 4>i < 2Q 
Q if 4>j - 4>i> 2Q 

and by definition q* (t , x\ <j>) = - Q*j (t, x;cj)). 

Notice that the individual optimal switching function q* in (13.111) does not explicitly depend 
on the aggregate behaviour of the population X. However, in the model, the variable 4> will be the 
value of the value function in (13.71) at each time instance, which depends on X. 

To summarise, the function a* in (13.41) gives the best response for any inspector to the prevailing 
crime distribution X(t) £ E^ at any time t £ [0, T], The running cost function h in (13.61) for a 
representative inspectee depends on the best response function of the inspector a*. The value 
function V in (13.71) for the representative inspectee is the solution of the HJB equation (13.91) . The 
resulting best response of a representative inspectee is the optimal switching function q* in (13.111) . 
Rationally, the representative inspectee applies the resulting optimal control policy q* to control 
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her state dynamics of crime levels. The resulting crime distribution evolution, as the aggregate 
behaviour of the continuum of inspectees, is described by the solution X = {X(t), [0, T]} to (13.21) 
with the optimal switching q* in (13.111) . The solution X should be consistent with the one observed 
by the inspector as the prevailing crime distribution. Hence we get the following system of coupled 
equations 


*¥M). + H(l i ,V(t,-),X(t))= 0, i = l,...,d 

V(T,.) = J t (;X(T)) 


, X(0) = X'(0) 

where the Hamiltonian H is defined in (13.101) and the optimal transition function q* is defined in 
(|3.11|) . The main feature of this coupled system of equations is that Eq. ()3.12|) is a backward 
ordinary differential equation, yet (13.131) is a forward ordinary differential equation. This model 
can be viewed as a modified version of standard mean field games equations in discrete-state space 
setting with a deterministic major player. 


4 Main results 

First, we discuss the existence and uniqueness of a solution to the coupled system (|3.12I) - (|3.13D . 
In Theorem 14.11 we show that for a short time game, there exists a unique solution to the mean 
field inspection game, namely, the single inspector has a unique best response to the continuum of 
inspectees and any representative inspectee has a unique best response to the inspector and the 
aggregate behaviour of the continuum; whereas, for a long time game, we show the existence of a 
solution to the mean field inspection game. Then in Theorem 14.21 we show that the probability 
distributions of finite IV-inspectees on their state space converges to the one of a limiting system 
as N —>• oo. Finally, in Theorem 14.31 we conclude that a solution to the mean field inspection game 
is an e-equilibrium to an inspection game with a finite number N of inspectees. 

Theorem 4.1. (i) For a small T, the coupled system of equations (13. 121) ~ (|3.131) has a unique 
solution ( X , V); 

(ii) for an arbitrary finite T, there exists a solution to the coupled system of equations (13.121) - 
(13T3D . 

Proof. The proof to Theorem 14.11 consists of three steps and can be found in Appendix [A] □ 

Next, we prove that the pair of resulting best responses a*(X(t)) and q*(t, li, X(t)] V(t, •; X)) 
presents an e-equilibrium of an inspection game with one inspector and N inspectees, where ( X , V) 
is a solution to equations (|3.12I) - (I3.13D and a* and q* are defined respectively in (|3.4I) and (|3.11D . 

To this end, we will tag one insepctee and impose that she applies a switching strategy q(t. k,x) 
which is Lipschitz continuous in the variable x and different from q*(t,li,x). Let M^ 9,q (t) (resp. 
M^ 9 ’ q (t)) denote the state dynamics of the tagged inspectee with switching strategy q (resp. with 
q*) in the finite N inspectees setting with a given initial data mAr(O) £ L^. Let M ta9,q {t) (resp. 
M tag - q (t)) denote the state dynamics of the tagged inspectee with switching strategy q (resp. with 
q*) in the continuum limit with a given initial data m(0) £ L^. Meanwhile all other inspectees 
apply the same strategy q*. 
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The controlled Markov process {(A^. ~^(t), (tf) ,t £ [0,T]} of N interacting inspectees is 

generated by the operators L^[q*,q] acting on [C {x L d)] d (the set of continuous and bounded 
vector-valued functions / on x L^): 

d 

Lt [q*,q]f(y,k) = Y^ y ' ei)q* ij {t,y)N [f(y — ft + ft,l k ) - f(y, l k )\ 

i,7=l 

i^k 

d 

+ Y^ y ■ e k - v) N [f(y ~ % + »4 ) - f(y, 4 )] 

3 = 1 
d 

+ Y y)N [f(y -7 $ + %,lj)~ f(y, 4 )] (4.1) 

3 = 1 


where y = ( n\/N ,..., rid/N). 

One can under the operators in (14.11) in the following way. Consider that the tagged inspectee is 
at the crime level l k £ and there are n k inspectees who are at the crime level Ik- The first term 
in (14.11) prescribes the interactions between inspectees who are any crime levels l t £ \ Ik, and 

apply the strategy q*\ the second term in (14.11) prescribes the interactions between inspectees who 
are at the crime level Ik, except the tagged inspectee, and apply the strategy q*\ the third term in 
((4.1(1 prescribes the behaviour of the tagged inspectee who are at Ik and applies the strategy q. 

Remark 4.1. In the definition of the operator L^[q*, qj in (14.11) . the position of the tagged inspectee 
is counted twice as his own position (crime level) M^ 9,q (t) and as his contribution towards the crime 
empirical measure X(f ig {t). Alternatively, instead of Xfi t -At) one can take the empirical measure 

of other inspectees only, i.e. A/^T 1 (t). Such notations are used e.g. by Gomes in [LfiJ . However, 
the distinction between these two notations disappear in the limit N —> oo. 

The controlled process {(Xf q *^(t),M ta9,9 (t)),t £ [0,T]} of the continuum limit as N — > oo is 
generated by the limiting operators A t [q*,q\ : [C' 1 (S ( i x Ld)] d —> [C(T,d x Lrf)] d of L^ in (14.11) : 

Mq*,q\f(x,l k ) ■■= lim f(y, l k ) 

iV—»oo 
y->x 

d r, p d 

= Y( Xiq ij( t ’ x ') ~ x 3^ji(^ x )) + “ f( x ,h)) (4.2) 

j =1 3 J =1 

iAj 

where the space [C 1 (S^ x L^)] d is the set of continuous and bounded vector-valued functions / on 
T,d x L d which are differentiable in the first variable. 

Remark 4.2. It is worth noting that the process {(A4* -, (t), M^ 9 ’^(t)), t £ [0, T]} is a Markov 
process only if this pair is considered as an entity. In other words, a single component, either 
{Xfa* q\(t)}t £ [0,T]} or {M^ 9,9 (t),t £ [0, T]}, is not a Markov process and cannot be discussed 
separately, since the distribution evolution of the N interacting inspectees {Xfi t -At),t £ [0, T]} is 

coupled with any individual’s dynamics {M 1 ^ 9,9 (t),t £ [0, T]}. 

In the continuum limit, since any single inspectee’s behaviour has negligible impact on the whole 
population’s statistical behaviour, the distribution dynamics {Ar q * Mt),t £ [0,T]} is still the solu¬ 
tion to the ordinary differential equation (13.131) . although one inspectee chooses a different strategy 
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from q*. Since {X[ q * ^(t),t £ [0, T]} is a deterministic process and not affected by any single in- 
spectee’s behavior {M tag ’ 9 (t),t £ [0, T]}, one can view {M ta9,q (t),t £ [0, T]} as a Markov process, 
parameterised by {X[ q * ^(t),t £ [0,T]}. 

We show that, as N — > oo, the Markov process {(X^, ^(t), , t £ [0,T]} generated by 

Lp r in (14. ip converges to the Markov process 

{(X[ q * q](t), M ta9 ’ 9 (t)),t £ [0,T]} generated by A t in (14.21) . This result is crucial for the final result 
which is stated in Theorem 14.31 

To this end, we need the concept of propagators. For a set of continuous function C(T,d,I^d), a 
family of mappings 'F t,r from C(T,d,I^d) to itself, parametrized by the pairs of numbers r < t (resp. 
t < r) from a given finite or infinite interval is called a (forward) propagator in S, if T 4,4 is the 
identity operator in C (Srf,L^) for all t and the following chain rule , or propagator equation , holds 
for r < s <t: 

xfrt,s.q,s,r = q,t,r 

Let ^°jf. tag [q*, q] denote the propagator generated by L^[q*,q] in (14.11) and ^^ g [q*,q] the prop¬ 
agator generated by A t [q*,q] in (14.21) . By saying this, we mean that for / € D(T^), the equations 


d T 

—T 


t,S 


ds N ’ ta9 


/ = * 


t,S 

N;tag 


Lsf, 


d 

ds 


s,r 
N\tag 


f = ~L 


N ^s r , 
s * N\tagJ i 


0 < t < S < r, 


hold a.s. in s and for / £ D(Aj), the equations 


—&’ s f = 
ds tagJ 


ta 9 A s f , ]f = ~A 


s*Zgf, 


0 < t < s < r, 


hold a.s. in s. 


Theorem 4.2. Suppose that as N oo, the initial data converges to certain xq £ 

and the initial data m]\r(0) £ L d converges to certain m{ 0) £ L^. Then for any switching function 
q which is Lipschitz continuous in x and any f £ [C ,1 (S ( i x L d)] d 


lim 

N —^OO 




o ,t 

N;tag 


[q*,q]f(xo,m N {0)) 


®t£g[q*,q}f( x o,m(o)) 


= 0 


(4.3) 


uniformly in t £ [0, T] with any T > 0. 

Proof. First, if the space [C <1 (Srf x Ld)] d is a core of the propagator generated by the 

limiting operator At [q *, q] defined in (14.21) , then we have the convergence of the generators on the 
core of the liming semigroup, i.e., for any / £ [C l (Yid x Ld)] d 

lim L?[q*,q\f(x,l k ) = A t [q*,q]f{x,l k ) 

N-t oo 


which implies the convergence of the semigroup on the core [C ,1 (S^ x L^)] rf , c.f. Kallenberg (2002), 
i.e. the statement (14.3 j) is proved. 

The result that [C ,1 (S^ x L^)] rf is a core of the propagator ^f g [q*,q] generated by the limiting 
operator A t [q*,q] defined in (|4.2I) is proved in Appendix iBl in 3 steps, see Appendix IB. 11 Appendix 

El 

Here we only need to check the conditions in Appendix IB. 31 are satisfied. By (13.111) and (IA.3D 
we have that the optimal switching function q*(t, h,x) is Lipschitz continuous in x. It is clear that 
from (|3.13p . the function F in (IB.71) with Fi(t,x ) = x '- V(t, ■)) is Lipschitz continuous 

in both t and x. Together with the condition that q is Lipschitz in the variable x, the conditions 
in Appendix IB. 31 are satisfied. The proof is completed. □ 
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Remark 4.3. This kind of convergence results of N-particle approximations have been proved e.g. 
by Kolokoltsov, Troeva and Yang (2014) for a diffusion process and by Kolokoltsov, Li and Yang 
(2012) for rather general Markov processes with smoothing property (excluding the present case). 

As a direct consequence of Theorem 14.11 and Theorem 14.21 , we have the result in the following 
theorem, stating that any solution derived from the limiting model (13. 12f) ~ (13.13D can be used to 
approximate the one for an N player game. 

Theorem 4.3. Suppose that 

(i) as N —> oo, the initial data x(( € converges to certain xq £ and the initial data 
7n/v(0) G L d converges to certain m(0 ) £ L^. 

(Hi) the terminal cost function Jt : x — > R is Lipschitz in both variables. 

Then a strategy profile 

{a*(X(t)),q*(t, ■, X(tf V(t, ■)), X(t); V(t, •))} 

with a* and q* defined respectively in (13.41) and (13.111) . and with (X,V) being a solution to the 
system (1 3.12 j )- (] 3.13 D . is an e-equilibrium in any N inspectee inspection game with e = e(N) —y 0 as 
N oo. 

Proof. First denote by (t) and M q (t) the state dynamics of an inspectee with the switching 
strategy q* in the finite N inspectees setting and in the continuum liming setting, respectively. 
Similarly, denote by X^Mt) and X[ q *](t) the state dynamics of the population with every inspectee 
applying the switching strategy q* in the finite N inspectees setting and in the continuum liming 
setting, respectively. In fact, X[ 9 *](i) and Xi q * q i(t) generated by the operator A t [q* ,q\ in (14.21) are 
the same object, namely the solution to Eq. (|3.13j) . see Remark 14.21 

To show that the strategy a*(X^if)) with X[ q *}(t) being a solution to (13.131) is an e-equilibrium 
for the inspector, we need to show that, for e = e(N) > 0, 

*[,.](*)) > UNiaUtyxg.fiV-e (4.4) 

where the payoff function Un defined in Eq (12.91) . and the inspector’s best response functions oc* N (t) 
and a*(t) are defined in Eq (|2.10|) and Eq (13.41) in the N inspectees setting and in the mean field 
inspection setting, respectively. 

By the definition of a.* N (t) and a*(t), we have that a* N (t) and a*(t) are Lipschitz continuous in 
X N (t) and X(t) respectively. Then by Theorem 14.21 we have 

lim |ct*(f) — off] (f) | = 0. 

7V->oo 

That is, for N big enough, we have 

a*(t) = a* N (t ) ± e 

with e —> 0 as N —> oo. Therefore for N big enough, again by Theorem 14.21 

U N ( a^tyx^t)) - UNiaUtyX^ft)) 

( d 

=E -a*(ft) + L^X^it)^(a*(t))ah - (1 - P(a*(t)))h ) 

V i=i 

/ d 

- E -a)v(t) + L E X klY] (*) - (1 - P(a* N (t)))li ) 

V i =i 

> - e, 
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with e —> 0 as N —> oo, where the continuity of P in a is used. 

Next, to show that the strategy q* is an e-equilibrium for an individual inspectee, we need to 
show, for any inspectee a = 1 ,,N and e = e(N ) > 0 

> J {a \t,h,q-,X^ S ) - e (4.5) 

for any q where the payoff function is defined in ( 12 . 61 ) . Since the payoff function J defined 
in ( 12 . 61 ) is Lipschitz in X N , the appendix [0 implies Eq. ( 14 . 51 ) . The proof is completed. □ 


5 Approximation analysis 

In this section, we will approximate the optimal switching strategy q* defined in (13.111) by a sequence 
of smooth function q*, for q > 0. We prove that any smooth approximation q* as a solution to 
a forward-backward model is also e -Nash to any inspection game with finite-number inspectees. 
This approximation analysis is motived by the following two considerations. 

Firstly, based on the result that there exists a solution, a consistent pair (X(t),V(t,-)), to 
the system of equations (13.12M3.13I) . one can obtain an optimal investment strategy a*(X (t)) by 
(13.4|) and an optimal switching strategy q*(t,li,X(t)\V(t,-)) by (13.111) . Since there are no analytic 
formulae for computing X and V, one needs to find numerical solutions of X and V. In this paper, 
we do not attempt to investigate methods for abstaining numerical solutions but we intend to adapt 
our results proved in the previous sections for numerical analysis. Recall that the obtained optimal 
switching strategy q* defined in (|3. 11 1) is a Lipschitz function in x. However, for numerical analysis, 
very often the smoothness of the function q* is needed. The results in Theorem 15.11 makes our 
theoretical results in section 4 applicable to numerical analysis. 

Secondly, considering smooth approximations q* as e -Nash equilibria to the inspection game 
with finite-number inspectees enables us to discuss the error bound of the approximation as N —>• oo. 

Now by standard procedure we construct a sequence of matrix-valued smooth functions q* : 
[0, T] x Lrf x E^ —> R J to approximate the continuous function q*. We define for q > 0 

q*(t,li,x):= q*(t,k,x - y)cj> v (y)dy (5.1) 

•'Erf 

where the function cj) v is a smooth modifier. We have that for any t £ [0, T] and h £ L^, q* converges 
to q* uniformly on E^, i.e. 


lim sup \q*(t,li,x) - q*(t,k,x)\ = 0. 


_ y_ 

A typical example of the molllifier ^ can be 4> v (y) = ^o wv e 2v ■ 

Theorem 5.1. Suppose that as N — >• oo, the initial data x^ € converges to certain xo £ E^ and 
the initial data mjv(0) € L d converges to certain m( 0) £ L^. Moreover, the terminal cost function 
Jt : x Yj c j R is Lipschitz in both variables. Then 

(i) any q* defined in (15.11) is an e-Nash for a finite game with e = e(q, N ) -» 0 as N —> oo and 
q —> 0. 

(ii) if q* defined in (ED is two continuously differentiable in x, e is of order 1/N. 

Proof, (i) To prove q* is an e-Nash, we aim to prove that for (f, If) £ [0, T] x and for any other q 


J^(t,h, q* v ,X^) > jW(t,k,q,X") 


— e. 


(5.2) 
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Take the approximating smooth optimal control q*(t, k,x) = (q* a (t, X v (t)),... >^(*>^(0)) and 
consider the system 


= E U x vAtKji^ x vW)> i =• • • - d 

X(0) = ®(0) 


(5.3) 


Let X q * be the solution to the system (15.31) . We have that X q * converges to X q * as 77 —>• 0. By 
Theorem 14.21 as N —>• oo, Xfi — > X a * and Xfl —> X q *. Therefore we have XfL —> Xfi as N —> oo 

and 77 —> 0. Since the payoff function J' a ' defined in (12.61) is continuous and Lipschitz continuous 
in we have for small enough rj > 0 and big enough N 


J (a) A k, q*X») = (t, k, q*X») + (t, k, q*X») - J « (t, k, ql,X%) 


= A a Ht,li,q*,X^)±e( V ,N,q*) 


>A a \t,U,q,X^)-e( V ,N,q,q*) 


where the result in the appendix ICl is used. Hence (15.21) is proved. 

(ii) To prove e is of order 1/N for a twice continuously differentiable q*, we aim to show that 
for / € [C 2 (S d x L d )] d , 


> m Jv(°)) -®tag[q*ri’q}f( x o,m{0)) < C(T) j- 1| / 1| [C 2 (SdxLd)] d. (5.4) 


„iV 


,o,t 


iV 1 


Here \L^ to9 [g*,g] denote the propagator generated by L^[q*,q\ and ^^ g [q*,q\ the propagator 

generated by A t [q*,q\. The space [C 2 [Y,d x L^)] d is the set of continuous and bounded vector¬ 
valued functions / on Sj x which are twice continuously differentiable in the first variable. 

By (14.11) and (14.21) , we write 


and 


d 

L^[q*, q]f(y, h) = Yl(y v) N [fiv ~N + N’ l k) ~ /(*/> l k)\ 

i,j =1 
i^k 

d 

+ ^{y ■ e k - jf)q q , kj (t, y)N [f(y -77 + 7 $,lk)~ fiv, k)] 

3 = 1 
d 

+ [f(y ~W + x3j) - fiv, h)] (5.5) 

3 = 1 


[9^ ?]/(*. ifc) - x jQ* v ,jA,x)) §~AJk) 

j =1 3 

*74? 

d 

+ '52Qkj(t,x){fi x ,lj) - fix, Ik)). (5.6) 

i=i 
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In fact, since q* is twice continuously differentiable in x, the space [C 2 (Ti d x L^)] rf is a invariant 
core for L^ [q *, q] and A t [q *, q ], that is 

L?[q;,q\ : [C 2 (S d x h d )} d [C 2 (E d x L d )] d 


and 

A[g*,g~] : [C 2 (£ d x L d )] d [C 2 (£ d x h d )] d . 

Further, by Taylor theorem, (15.5p can be expended by using the following representation (ref. 
Kolokoltsov (2010), Corollary F.2) 


f(y - Ci h) - f(y, h) = [ <>l + J q ds (* _ s ) 

Therefore, for / € [C 2 (T. d x Lrf)] d , we have 


d 2 f(y,h ) 

ac 2 


\\{L t ~ At[q*,q\)f\\[c2(?, d xL d )] d < C{T) — ||/||[c 2 (s d xL ti )] d - 

To complete the proof, we need the following calculation: for s <t 




S,t 

N;tag 


f - f = Ttf [I* f = 

J ^taqJ ^ N: taa^taq LI r=sJ 


L 


t r i , 

— (Ttf $1’* 

( J r \ N-.tag tag 


f dr 


L 


= T 


N\tag 


Lf - At ) *&/cir. 


r.t 


tagJ 


(5.7) 


(5.8) 


(5.9) 


By (|5.8D and (15.91) together, we get the required statement (15.41) . Consequently, by the definition 
of jW in (12.61) we have 


sup 

t 


<C(T)-\\f\\ [c ^ dXhd)]d 


□ 


6 Appendix 

A Proof to Theorem 14.11 

The proof to Theorem 14.11 consists of three steps. 

Step 1: for any given X £ C([0, T], T, d ), we show that the HJB equation (J3.12I) is well posed. 
Moreover, the resulting solution, denoted by V(t, li',X), is Lipschitz with respect to the parameter 
X. 

For proving the existence of a solution to the ordinary differential equation (13.121) , it is sufficient 
to have that the function H defined in (13.101) is Lipschitz in (j> uniformly. 

Since the optimal switching function q*j in (13. lip is Lipschitz continuous in (f>, we conclude that 
there exists a constant c > 0 such that for any Zj £ L<j, x £ T, d , 

x) - H(li,i/),x)\ < c\\(j) — V’lh € R d . (A.l) 

Therefore, for any X £ C([0, T], T* d ), there exists a unique solution to (13. 12[) . 
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To show that the solution V is Lipshitz with respect to the parameter X, we write the equation 
(13.911 in integral form: 

V(t, lf,X) + J H(U, V(s, •; X),X(s))ds = 0. 

The function H dehned in (13.101) is Lipschitz in x uniformly, since for any Zj € Lrf, (j> E R d and any 
x,y e E rf , 


I - H{k,(f>,y) | < k(l + a) |-P(a*(x)) - P(a*(y)) | < c\\x - r]\\ T v (A.2) 

with a constant c > 0. For any t E [0, T], € Lrf and X, Y E C([0, T], Erf) 

\V(t,l i -X)-V(t,l i ;Y)\ 

< J \H(li , •; X), X(s)) - H(k, V(s, •; E), F(s))|ds 

<f \H(k,V(s,-,X),X(s)) - H(i,V(s,-,X),Y(s))\ds 
+ \H(U, V(s, -,X),Y(s)) - H(U, V(s, ■; Y),Y(s))\ds 

< cT\\X - EIU + cf || V(s, -;X) - V(s, -,Y)\\ds 

where (I A. 1 1) and (1A.2I) are used to get the last inequality. Then by Gronwall’s inequality, the 
solution to (I3.12|) is Lipschitz continuous in X, i.e. there exists a constant c > 0 such that for any 
t E [0,T], 


\\V(t,-,X)-V(t r -Y)\\<cT\\X-Y\\ 00 . 

Consequenctly, by the definition of q* in (13.lip , we have that for any t E [0,T] and i, j = 1 ,,d 
with i / j, 

I Qijit, X(ty, V(t, •; X)) - Y(ty V(t, •; Y))\ < cT ||X - Y 
Step 2: we prove the existence of the solution to the following equation 

f <-i.<* 

1 X(0) = x(0) 

with a given switching policy q which is Lipschitz in x and then prove the sensitivity of the solution 
X(-) with respect to those q which are Lipschitz in x. 

Define a vector field G : [0, T] x Erf x R —> R^ with its zth component Gi : [0, T] x Erf x R —» R 

d 

Gi(t , x, q) := ^2 , x). (A.5) 

3 = 1 

To prove the existence of a solution to (13.131) it is sufficient to prove that G is Lipschitz continuous 
in x. We say G is Lipschitz continuous in x if each component Gi, i = 1 ,d, is Lipschitz in 


(A.3) 

(A.4) 
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x. Now we show that Gj is Lipschitz in x by using Schwarz inequality. Since q is assumed to be 
Lipschitz in x, we have for t £ [0, T] and x,y £ £<f, 

d d 

I Gi{t,x,q) - Gi(t,y,q)\ < ~ Qjifav)) I + I ~ y)\ 

i= 1 i =1 

<c||x-y||+Q||x-y|| (A.6) 


which implies that Gi is Lipschitz in the second variable x. Hence, for any given q which is Lipschitz 
in x , there exists a solution to (I A. 41) . 

To prove the sensitivity of the solution to (IA.4D with respect to those q which are Lipschitz in 
x, we rewrite (IA.4I) in a integral form. Let X ( resp. Y ) be the solution to (IA.4D under the control 
policy q\ (resp. q 2 ), with the same initial value xq £ namely 


and 


Xi(t) = x 0 ,i+ [ Gi(s,X(s),qi(s,li,X(s)))ds 
Jo 

Yi(t) = x 0 ,i+ f Gi(s,Y(s),q 2 (s,li,Y(s)))ds. 
Jo 


Then by (1A.6[) . we have 

\Xi(t) - Yiit )| < [ \Gi(s,X(s),qi(s,li,X(s)))ds - Gi(s,Y(s),q 2 (sJi,Y(s)))\ds 
Jo 


< 


[ | Gi(s,X(s),qi(s,li,X{s))) - G(s,X(s),q 2 (s,li,X(s)))\ds 
Jo 


rt 


< 


+ / \Gi(s,X(s),q 2 {s,li,X(s))) - Gi(s,Y(s),q 2 (s,li,Y(s)))\ds 
Jo 

[ c\q 1 (s,l i ,X(s)) - q 2 (s,li,X(s))\ds + f c||A(s) - Y(s)\\ T vds. 
Jo Jo 


By Gronwall’s inequality, we get for any t £ [0, T] 

d 

\\X(t)-Y(t)\\ TV =X)I x i(t)~Yi(t)\ < cT sup \\q 1 (;l i ,X(s))-q 2 (;l i ,X(s) 

i—l 

with a constant c > 0. 

Step 3: In summary, so far we have considered the following mapping 


(A.7) 


X —>■ g* —» X 

F: C([0,T],S d ) => C([0,T],S d ). (A.8) 

By analysing (|3.12l) we get (IA.3I) . namely the resulting optimal switching function q*(t,li,x ) is 
Lipschitz with respect to x\ further, by analysing (|3.13l) with any switching policy q which is 
Lipschitz in x. we get (1A.7I) . namely the solution to (13.131) is Lipschitz with respect to its control 
parameter q. Therefore we can conclude that the mapping T (IA.8I) is Lipschitz, that is, for any 
X, Y £ C'([0, T], Srf), there exists a constant c > 0 such that 

IIT(AT) - rpOHoo = \\X - Y\\oo < cT\\X - yiloo (A.9) 
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where the norm || ■ Hoc is defined in (12.211 . 

Thus for a small T, the mapping T is a contraction, proving statement (i). For arbitrary finite 
T, one has, from (13.21) . that the image of the mapping T is bounded equicontinuous and hence 
a compact subset of (7([0, T], (by Arzela-Ascoli theorem). Hence by the Brouwer fixed point 
theorem, T has a fixed point, proving statement (ii). 


B [C l {Y>d x hd)} d is a core for the generator A t 

This appendix aims to prove step by step that the space [C^E^ x L d)] d is a core for the limiting 
generator At defined in (14.21) . In IB. 11 we consider a single deterministic system X(t),t £ [0, T] and 
show that [C' 1 (Srf)] rf is a core for the generator of the system. Then in IB.21 we consider a time- 
homogenous Markov chain M(t),t £ [0,T] modulated by a deterministic system X(t),t £ [0,T]. 
We show that [C 1 (Erf x L d)] d is a core for the generator of the system (M(t) , X (f)), t £ [0, T], 
Finally in IB.31 we consider a time- nonhomogenous Markov chain M(t),t £ [0,T] modulated by 
a deterministic system X(t),t £ [0, T], This Fellow process (M(t), X(t)), t £ [0, T] is exactly the 
one generated by the limiting operator A t defined in (14.21) . We show that [C ll (Srf x 1^)]^ is indeed 
a core for the generator of the system X(t)),t £ [0, T], namely the generator At defined in 

( 021 ). 

Recall that the space [C' 1 (SrfXLrf)] d is the set of continuous and bounded vector-valued functions 
/ on Yid x L (i which are differentiable in the first variable. The standard notation ’ denotes the 
differentiation with respect to time, e.g. x = %. 


B.l Evolution of the deterministic dynamics X(t) 

Consider a system X(t),t £ [0, T] which is described by the first-order ordinary differential equation 

X t = F(X t ) (B.l) 

with a given initial xq £ E^ and F : E^ — >• R r/ . The system X(t),t £ [0,T] has the generator 
A : [C 1 (Ed)] d —> [C(Erf)] d which is of the form 

Af(x) = F(x)^(x). (B.2) 

Let (j>t denote the semigroup generated by the generator A in (IB.2I) . The solution of Eq. (IB. II) is 


f(X t (x 0 )) = ((j)tf)(x 0 ). 

Lemma B.l. If the function F in (IB.ID is Lipschitz, then [C 1 (E^)] rf is a core of the generator A 
in (IB.2j) . 

Proof. Since the space [C' 1 (Erf)] rf is not invariant under the operator A (|B.2|) . we cannot apply the 
standard result (c.f. Kallenberg (2002)) that a dense invariant subset of the domain is always a 
core. We introduce a subspace by collecting all shifted functions from [C' 1 (Erf)] rf and their linear 
combinations and denote this space by [(^(E^)]^ 

C( E d ) := {fi{X t (x 0 ))\Vt £ [0£ C\ S d )} 

with i £ {1,... ,d}. By its construction, this space [(7(Erf)] rf is an invariant core for the semigroup 
(ft. In order to prove that [C7 1 (E^)] rf is still a core for the semigroup eft, we construct a sequence 
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of vector-valued smooth functions F n and for each n £ N, the entry Ff 1 £ C' 1 (Srf),i = 1 ,d is 
defined as 

F P(x) := f Fi(x - y)<i> n {y)dy 

where the mollifier <f n is at least hrst order differentiable and with compact support. So we have, 
as n —> oo, Ff 1 £ C' 1 (S ( i) converges to Fij £ C(T, d ) as n —> oo. Then we consider the following 
approximating systems 

X t = F n (X t ) 

with an initial xq £ E^. Let X”(xo) denote the solution to this approximating systems. Since F n 
converges F as n —> oo, we have X™(xo) converges to Xi as n —> oo. Therefore the closure of the 
subspace [C' 1 (Erf)] rf is [C(Y*d)] d which is a core of A, hence [C' 1 (Erf)] d is a core of A. 

□ 


B.2 A homogenous Markov chain M(t) modulated by a deterministic evolution 

X(t) 

Consider a Markov chain M(t),t £ [0,T] on with a switching function q on 

/ qn(x), ..., q ld (x) \ 


(x) = 


Qdi(x), qid(x ) 

V <7di0), ■■■, (Idd(x) ) 


(B.3) 


Let the Markov chain M(t),t £ [0,T] be modulated by a deterministic evolution X(t),t £ [0,£] 
which is described by the first order ordinary differential equation 


X t = F(X t 


(B.4) 


with a given initial xo £ E^ and F : T, d —> R d . Then the modulated Markov chain is a Feller 
process on [C(T, d x L^)] d , which is denoted by (X(t),M(t)),t £ [0,T] and is described by 


h = Q(X t (x 0 ))h 

with a given initial function ho £ C(T, d x L^). The solution of (IB. 51) is 

ht(ho,x 0 ) = {il> t ho)(x 0 ) 


(B.5) 

(B.6) 


where i/dt is the semigroup of the Feller process (X(t),M(t)). 

Lemma B.2. If the functions F in (IB. 41) and q in (IB.31) are Lipschitz, then [C' 1 (E ( i x L^)] rf is a 
core for the generator of the semigroup ift in (IB. 61) . 


Proof. Following the proof for A.l, we construct a sequence of matrix-valued smooth functions 
with each entry qff- £ C 1 (E^) defined as 

Qii( x ) : = -y)4> n {y)dy, i,j = l,...,d 

where the mollifier (f) n is at least first order differentiable and with compact support, so that Q n 
converges to Q as n —> oo. The solutions to the approximating systems 

h = ® n (X?(x 0 ))h 
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are denoted by hf(xo,ho), where the sequence Xf(xo) is constructed in the proof for A.l. Since 
X”(x'o) —> Xt(xo) as n —> oo and Q is a Lipschitz function, we have Q n (X"(xo)) —> Q(A t (xo)) as 
n — > oo. Hence, hf(x o, ho) ht{x o, ho) as n — > oo. Therefore the closure of the space [C ,1 (SrfxLrf)] d 
is [C(T,d x Lrf)] rf , hence [C ,1 (Srf x Lrf)] d is a core the generator of the semigroup 'i/’t in (1B.6|) . □ 


B.3 A non-homogenous Markov chain M(t) modulated by a deterministic evo¬ 
lution X(t) 

Consider a time non-homogenous Markov chain M(t),t £ [0,T] on with a switching function Q 
on [0, T] x Y, d 



/ qu(t,x), . 

Qi d(t,x) 

\ 

(t,x) ->• Q(t,x) = 

qn(t,x), . 

■ qid{t,x) 



V Qdi(t,x), . 

.., q d (t,x) 

/ 


(B.7) 


Let the time non-homogeneous Markov chain M(t),t £ [0, T] be modulated by a deterministic 
evolution X(t) which is described by 

X t = F(t,X t ) (B.8) 

with a given initial xo £ R (i! and F : [0,T] x —y R d . Then the modulated non-homogenous 
Markov chain is a Feller process on [C(T, d x L^)]^ which is denoted by (X(t),M(t)),t £ [0,T] and 
is described by 


h = Q(t, X t (x 0 ))h (B.9) 

with a given initial function ho £ C(T, d x L^). The solution of (IB. 51) is 

h t {h 0 ,xo) = ('i/j t ’ 0 h 0 )(xo) (B.10) 

where 'il> t,s , 0 < s < t, is the two-parameter semigroup of the Feller process (X(t), M(t)). 

Lemma B.3. If the function F in (IB. 81) and q in (IB.71) are Lipschitz in both x and t, then 
[C 1 ^ x h d )] d is a core for the generator of in (IB.101) . 

Proof. Set y = (x,t) £ x [0, T], Then Markov chain M(t) is governed by the switching function 
q(y) and the system (IB. 81) is translated to 


X t = F(y) 
s = 1 

with a initial data (xo, 0). Then a direct application of the result in the appendix [B/2] complete the 
proof. □ 

C e equilibrium under a general payoff function J 

This appendix states that, for an inspection game with a general payoff function J of inspectees, 
any optimal q* derived from the corresponding mean field inspection game model is an approximate 
Nash for any inspectee. 
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Consider all N inspectees aim to maximise a general payoff function J as a function of t,l,q 
and x. Let X A = {X N (t),t G [0,T]} be the distribution evolution of the N interacting inspectees 
among d states in L^. Let X = {X(t),t G [0,T]} be a solution to the mean field inspection game 
and q* be the resulting optimal switching strategy for a representative inspectee. If X N X and 
the payoff function J is Lipschitz uniformly in x, then q* is an e equilibrium for an inspection games 
with any finite N inspectees, namely for any (t,lf € [0,T] x L^, 

J(t,k,q*,X N ) > J(t,k,q,X N ) - e 

where e = e(N, q*, q) —> 0 as N —> oo. 

Proof. Since X N —> X and J is Lipschitz uniformly in x, for any (t, If) G [0, T ] xL^ and an switching 
function q, we have 

lirn J(t,li,q,X N ) = J(t,k,q,X). 

TV—>oo 

Since inspectees aim to maximise their payoffs and in the limit N —> oo, q* is the optimal 
strategy, for any (t, If) G [0, T] x we have 

J(t,k,q*,X ) > J(t,k,q,X) 

for any q. Therefore, for any (t, If G [0, T] x and for N big enough, there exists an e = 
e(q*,q,N) > 0 so that 

J(t, k,q*,X N ) = J(t, k, q*,X) + J(t, k,q*,X N ) - J(t, k, q*, X) 

= J(t,li,q*,X)±e(q*,N) 

>J(t,U,q,X)±e(q*,N ) 

= J(t,li,q,X N ) + J(t,k,q,X) - J(t,li,q,X N ) ±e(q*,N) 

= J(t,k,q,X N ) ± e(q,N) ± e(q*,N) 

> J(t,k,q,X N ) - e{q*,q,N ) 

with e = e(q*,q, N) —> 0 as N —> oo. □ 
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